A Confidence Interval for the Wallace Coefficient of Concordance and Its Application to Microbial Typing Methods
نویسندگان
چکیده
Very diverse research fields frequently deal with the analysis of multiple clustering results, which should imply an objective detection of overlaps and divergences between the formed groupings. The congruence between these multiple results can be quantified by clustering comparison measures such as the Wallace coefficient (W). Since the measured congruence is dependent on the particular sample taken from the population, there is variability in the estimated values relatively to those of the true population. In the present work we propose the use of a confidence interval (CI) to account for this variability when W is used. The CI analytical formula is derived assuming a Gaussian sampling distribution and recurring to the algebraic relationship between W and the Simpson's index of diversity. This relationship also allows the estimation of the expected Wallace value under the assumption of independence of classifications. We evaluated the CI performance using simulated and published microbial typing data sets. The simulations showed that the CI has the desired 95% coverage when the W is greater than 0.5. This behaviour is robust to changes in cluster number, cluster size distributions and sample size. The analysis of the published data sets demonstrated the usefulness of the new CI by objectively validating some of the previous interpretations, while showing that other conclusions lacked statistical support.
منابع مشابه
Typing of Campylobacter jejuni and Campylobacter coli isolated from live broilers and retail broiler meat by flaA-RFLP, MLST, PFGE and REP-PCR.
We analyzed 100 Campylobacter spp. isolates (C. jejuni and C. coli) from Grenada, Puerto Rico and Alabama, which were collected from live broilers or retail broiler meat. We analyzed these isolates with four molecular typing methods: restriction fragment length polymorphism of the flaA gene (flaA-RFLP), multilocus sequence typing (MLST), pulsed-field gel electrophoresis (PFGE), and automated re...
متن کاملSubtyping of Campylobacter jejuni by Using Capillary Electrophoresis
24 Campylobacter jejuni is a common cause of the frequently reported foodborne diseases in 25 the developed and developing nations. This manuscript describes the development of multiple26 locus variable number tandem repeat analysis (MLVA) using capillary electrophoresis as a novel 27 typing method for microbial source tracking and epidemiological investigation of C. jejuni. 28 Among 36 tandem ...
متن کاملA Review of Reservoir Rock Typing Methods in Carbonate Reservoirs: Relation between Geological, Seismic, and Reservoir Rock Types
Carbonate reservoirs rock typing plays a pivotal role in the construction of reservoir static models and volumetric calculations. The procedure for rock type determination starts with the determination of depositional and diagenetic rock types through petrographic studies of the thin sections prepared from core plugs and cuttings. In the second step of rock typing study, electrofacies are deter...
متن کاملConstructing a Confidence Interval for Quantiles of Normal Distribution, one and Two Population
In this paper, in order to establish a confidence interval (general and shortest) for quantiles of normal distribution in the case of one population, we present a pivotal quantity that has non-central t distribution. In the case of two independent normal populations, we construct a confidence interval for the difference quantiles based on the generalized pivotal quantity and introduce ...
متن کاملBayes Interval Estimation on the Parameters of the Weibull Distribution for Complete and Censored Tests
A method for constructing confidence intervals on parameters of a continuous probability distribution is developed in this paper. The objective is to present a model for an uncertainty represented by parameters of a probability density function. As an application, confidence intervals for the two parameters of the Weibull distribution along with their joint confidence interval are derived. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PLoS ONE
دوره 3 شماره
صفحات -
تاریخ انتشار 2008